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ARTICLE INFO ABSTRACT 


Keywords: Structure predictions suggest a partial conservation of RNA structure elements in coronavirus terminal genome 
Coronavirus regions. Here, we determined the structures of stem-loops (SL) 1 and 2 of two alphacoronaviruses, human 
Replication coronavirus (HCoV) 229E and NL63, by RNA structure probing and studied the functional relevance of these 
cis-acting RNA element putative cis-acting elements. HCoV-229E SL1 and SL2 mutants generated by reverse genetics were used to study 
eee the effects on viral replication of single-nucleotide substitutions predicted to destabilize the SL1 and SL2 


structures. The data provide conclusive evidence for the critical role of SL1 and SL2 in HCoV-229E replication 
and, in some cases, revealed parallels with previously characterized betacoronavirus SL1 and SL2 elements. Also, 
we were able to rescue viable HCoV-229E mutants carrying replacements of SL2 with equivalent betacor- 
onavirus structural elements. The data obtained in this study reveal a remarkable degree of structural and 
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functional conservation of 5’-terminal RNA structural elements across coronavirus genus boundaries. 


1. Introduction 


Cis-acting RNA elements play important roles in the life cycle of 
plus-strand (+) RNA viruses, including RNA replication, viral gene 
expression and genome packaging (Barton et al., 2001; Liu et al., 
2009b; Firth and Brierley, 2012; Goto et al., 2013; Kuo and Masters, 
2013; Morales et al., 2013; Nicholson and White, 2014; Keane et al., 
2015). Compared to many other +RNA viruses, information on cis- 
acting RNA elements of coronaviruses, including their specific func- 
tions, structures and interactions, remains limited. Particularly, this 
applies to viruses from genera outside the genus Betacoronavirus (for 
reviews, see Brian and Baric, 2005; Masters, 2007; Liu and Leibowitz, 
2010; Madhugiri et al., 2014; Yang and Leibowitz, 2015; Madhugiri 
et al., 2016). Historically, RNA structures and sequences required for 
(beta)coronavirus RNA synthesis were characterized using defective 
interfering (DI) RNA-based systems (Chang et al., 1994, 1996; Raman 
et al., 2003; Raman and Brian, 2005; Brown et al., 2007; Gustin et al., 
2009). Thus, for example, RNA structure probing studies of mouse 
hepatitis virus (MHV) and bovine coronavirus (BCoV)-derived RNAs led 
to the identification of up to four stem-loops within the 5’-terminal 215 
nt of the genome (for recent reviews, see Liu and Leibowitz, 2010; 


Madhugiri et al., 2014, 2016; Yang and Leibowitz, 2015). In many 
cases, potential functional roles of RNA structural elements present in 
the 5'-terminal genome region could be confirmed by mutational ana- 
lyses. More recently, genus- and subfamily-wide RNA structure-based 
alignments using all currently approved coronavirus species in the re- 
spective genera of the Coronavirinae were performed for this highly 
divergent genome region. The studies led to a model of three highly 
conserved stem-loop structures, called SL1, SL2, and SL4, in the 5’ 
terminal, ~150-nt genome region (Kang et al., 2006; Liu et al., 2007; 
Chen and Olsthoorn, 2010; Madhugiri et al., 2014). Furthermore, nu- 
clear magnetic resonance (NMR) spectroscopy provided structural 
support for SL1 and SL2 in three betacoronaviruses, MHV, BCoV, and 
HCoV-0C43 (Liu et al., 2007, 2009a; Li et al., 2008). Also, a selective 
2’-hydroxyl acylation and primer extension (SHAPE) analysis in virio 
and ex virio confirmed the predicted SL1, SL2, and SL4 structures for 
MHV-AS9 (Yang et al., 2015). 

Possible biological functions of betacoronavirus 5’-terminal SL1 and 
SL2 structures in viral replication could be substantiated by reverse 
genetics studies (Kang et al., 2006; Liu et al., 2007, 2009a; Li et al., 
2008). For example, an MHV study revealed that destabilization of the 
upper part of SL1 produces viruses with replication defects, while 
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compensatory mutations restoring these base-pairing interactions led to 
viruses with near-wildtype growth kinetics (Li et al., 2008). In contrast, 
disruption of the basal part of SL1 was largely tolerated, while com- 
pensatory mutations that restored these base-pairing interactions 
proved to be lethal, suggesting a critical role for the RNA sequence 
(rather than structure) in this lower part of SL1. Based on these and 
other data, SL1 was suggested to require an optimal stability suitable to 
establish transient long-range (RNA- and/or protein-mediated) inter- 
actions between the 5’- and 3’-UTRs that may be required for genome 
replication and subgenomic (sg) mRNA synthesis. Other reverse ge- 
netics studies confirmed that the 5’-terminal SL2 is also required for 
MHV RNA synthesis (Liu et al., 2007, 2009a). Based on phylogenetic 
analyses, the SL2 was proposed to be the most conserved RNA sec- 
ondary structure in coronaviruses (Kang et al., 2006; Liu et al., 2007; 
Chen and Olsthoorn, 2010). It is composed of a 5-bp stem and a con- 
served loop sequence, 5’-CUUGY-3’, that was shown to adopt a 5’- 
uCUYG(U)a-3'- or a 5'-uYNMG(U)a-3’-like tetraloop structure (Liu et al., 
2009a). 

To extend these studies and corroborate predictions on alphacor- 
onavirus-associated 5'-terminal RNA structural elements, we used a 
combination of bioinformatics, biochemical and reverse genetics ap- 
proaches, focusing on structures and functions of the 5’-terminal SL1 
and SL2 structures in the HCoV-229E genome (genus Alphacoronavirus). 
The data obtained in this study provide evidence for the existence of 
two SL structures (SL1 and SL2) in the ~ 80-nt, 5’-terminal HCoV-229E 
and HCoV-NL63 genome regions. The structures were found to be re- 
quired for viral replication and appear to be (largely) conserved be- 
tween alpha- and betacoronaviruses. Thus, for example, we were able 
to show that the HCoV-229E SL2 structure can be replaced with that of 
the betacoronaviruses BCoV and SARS-CoV, respectively, providing 
experimental support for our previous hypothesis that (some) RNA 
structural elements in coronavirus untranslated genome regions may be 
more conserved than previously thought, even across genus boundaries 
(Madhugiri et al., 2014). 


2. Material and methods 
2.1. Cells and viruses 


Wildtype HCoV-229E and HCoV-229E mutants were propagated in 
Huh-7 cells. HCoV-229E titers were determined by plaque assay using 
Huh-7 cells. Recombinant vaccinia viruses were propagated in CV-1 and 
BHK-21 cells, and plaque purifications of single virus clones were per- 
formed using CV-1 and D980R cells as described previously (Isaacs 
et al., 1990; Thiel et al., 2001). 


2.2. Mutagenesis of the HCoV-229E full-length cDNA clone 


HCoV-229E mutants (HCoV-229E_C11G, _C16G, _G45C, _C47G, 
_C11G-G34C, _C16G-G29C, _G45C-C55G, and _C47-G53C) were gener- 
ated using the recombinant vaccinia virus VHCoV-inf-1, which contains 
a full-length HCoV-229E cDNA (GenBank accession number NC_ 
002645). Site-directed mutagenesis of the HCoV-229E cDNA insert in 
vHCoV-inf-1 was done using previously described methods (Thiel et al., 
2001). To construct vHCoV-inf-1 derivatives containing nucleotide 
substitutions in the HCoV-229E 5’-UTR, we used the plasmid pBS-5’GPT 
for recombination with vaccinia virus VHCoV-inf-1. This pBluescriptlI- 
derived plasmid was constructed to contain the E. coli gpt gene flanked 
by (i) a 500-bp fragment representing the vaccinia DNA sequence lo- 
cated upstream of the HCoV-229E cDNA insert in VHCoV-inf-1 and (ii) a 
500-bp fragment representing the cDNA sequence of nts 1001-1500 of 
the HCoV-229E genome RNA. Next, a gpt-positive vVHCoV-inf-1 deri- 
vative, called vRec-5’GPT, was selected from CV-1 cells infected with 
vHCoV-inf-1 and transfected with pBS-5’ plasmid DNA. In a second 
selection step, D980R cells were infected with vRec-5’GPT and trans- 
fected with an appropriate pBS-5’UTR-mut plasmid DNA. Using 
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appropriate selection conditions (Hertzig et al., 2004), gpt-negative 
vHCoV-inf-1 derivatives (called VHCoV_5'UTR-mut) that contained the 
desired mutation(s) in the 5' UTR cDNA sequence were isolated. The 
pBS-5’UTR-mut plasmid constructs used to produce the recombinant 
vHCoV_5'UTR-mut vaccinia viruses contained the 500-bp vaccinia virus 
sequence described above followed by a cDNA copy of HCoV-229E nts 
1-1500, with appropriate mutations being introduced by PCR-based 
mutagenesis. Sequences of VHCoV_5'UTR-mut vaccinia virus constructs 
were verified by Southern blotting and sequence analysis as described 
(Thiel et al., 2001). The VHCoV-inf-1 derivatives generated in this study 
were called VHCoV_5’UTR-C11G, vHCoV_5’UTR-C16G, vVHCoV_5’UTR- 
G45C, vHCoV_5’UTR-C47G, vHCoV_5’UTR-C11G+G34C, vHCo- 
V_5’UTR-C16G+G29C, VHCoV_5’UTR-G45C+C55G, and vHCo- 
V_5’UTR-C47G + G53C. Genome-length HCoV-229E RNAs were pre- 
pared by T7-based in vitro transcription (RiboMAX Large Scale RNA 
Production System, Promega) using purified genomic DNA from 
vHCoV-inf-1 and its mutant derivatives, respectively. 1.25 1g of in vitro- 
transcribed genome-length RNAs and 0.75 ug of in vitro-transcribed 
HCoV-229E nucleocapsid (N) protein mRNA (Schelle et al., 2005; 
Almazan et al., 2006) were used to transfect 1 x 10° Huh-7 cells using 
the TransIT® mRNA transfection kit according to the manufacturer's 
instructions (Mirus Bio LLC). At 72 h posttransfection (p.t.), cell culture 
supernatants were collected to determine viral titers, and total RNA was 
isolated for subsequent Northern blot and genome sequence analyses. 


2.3. RNA extraction and Northern blot analysis 


At 72h p.t., intracellular RNA was extracted using TRizol reagent 
(Invitrogen) according to the manufacturer's instructions. To analyze 
viral RNAs by Northern blot hybridization, 10 ug total RNA was de- 
natured for 10min at 65°C in loading buffer (50% deionized for- 
mamide, 18% formaldehyde, 1x MOPS) and separated in a 1% (w/v) 
agarose and 2.2 M formaldehyde-containing, 1x MOPS-buffered gel at 
16V for 16-17 h. The gel was soaked in buffer A (50mM NaOH, 
150 mM NaCl) for 30 min and then in buffer B (100 mM Tris-HCl / pH 
7.5, 150 mM NaCl) for 30 min. Next, the RNA was transferred onto a 
positively charged nylon membrane by vacuum blotting. The RNA was 
cross-linked to the membrane and hybridized with an [a-°2P]dCTP-la- 
beled DNA probe specific for HCoV-229E nucleotides 26857-27277 and 
the negative-strand complement of this sequence (TaKaRa Bio Inc). 
Following hybridization, membranes were rinsed 2 times with 2x SSC/ 
0.01% (w/v) SDS at room temperature and 2 times with 0.2x SSC/ 
0.01% (w/v) SDS at 55 °C for 30 mins. Hybridization signals were vi- 
sualized by autoradiography using a Typhoon 9200 imager (GE 
Healthcare). 


2.4. Genome sequence analysis of virus progeny 


At 72h p.t., cell culture supernatants were collected (passage zero 
[p0]) and used to determine virus titers and plaque sizes (see below). 
From the cell pellet, total RNA was extracted using TRIzol reagent 
(Invitrogen). Following reverse transcription (RT)-PCR amplification, 
the 5' and 3'-terminal HCoV-229E genome regions (nts 25-750 and nts 
25323-27317, respectively) were sequenced. The following primer 
pairs were used to produce two amplicons for subsequent sequence 
analyses: (1) HCoV-229E-25up (5’-ACTTAAGTACCTTATCTATCTA 
CAG-3’) and HCoV-229E-750dn (5’-GAAATTATCATCAATGGTCATACT 
TAC-3’) and (2) HCoV-229E-25323up (5’-CATGGAATCCTGAGGTTAA 
TGCAATC-3’) and HCoV-229E-oligo(dT) (5’-TTTTTTTTTTGTGTATCC 
ATATCG-3’). To determine the 5’-terminal nts 1-25 of progeny virus 
genomes, the FirstChoice™ RLM-RACE kit was used according to the 
manufacturer's instructions (Invitrogen). PCR products used for se- 
quence analyses were gel purified (innuPREP Gel Extraction Kit, 
Analytik Jena) and subjected to automated Sanger sequencing (LGC 
Genomics). The 5’-UTR and 3'-UTR amplicons were sequenced using 
oligonucleotides HCoV-229E-750dn and HCoV-229E-27317dn, 
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respectively. 
2.5. Virus titration 


Virus titers in the supernatants of cells transfected with the appro- 
priate full-length HCoV-229E RNA (see above) were determined as 
follows. Nearly confluent monolayers of Huh-7 cells that were grown in 
96-well plates using DMEM (supplemented with 10% fetal bovine 
serum [FBS] and antibiotics) were inoculated with 100 ul per well of a 
serial logarithmic dilution of cell culture supernatants obtained from 
transfected cells. Following incubation for 5-6 d at 33 °C, titers of in- 
fectious virus progeny (given as 50% tissue culture infectious dose 
[TCIDso] per ml) were determined using the method described by Read 
and Muench (Reed and Muench, 1938). 

Virus plaque assays were performed using confluent Huh-7 cells that 
were grown in 6-well plates. Cell monolayers were inoculated with a 
10-fold serial dilution of virus-containing culture supernatants. At 1h 
p.i., the inoculum was removed. Cells were washed with PBS and 
overlaid with 2 ml MEM containing 10% FBS, 1.25% Avicell (Sigma), 
and antibiotics. At 4 days p.i., the medium was removed and cell 
monolayers were stained with 0.1% crystal violet solution to visualize 
virus plaques. 


3. In vitro RNA structure probing 
3.1. In vitro transcription using T7 RNA polymerase 


Typically, 1 ug of PCR product representing the 5'-terminal ~ 100 
nts of the HCoV-229E and HCoV-NL63 genome, respectively, was used 
as template in in vitro transcription reactions. The reactions were per- 
formed using the T7 RiboMAX™ express large scale RNA production 
system (Promega) according to the manufacturer's instructions. DNA 
templates were digested using 1U of RNase-free RQ1 DNase (Promega). 
Free nucleotides were removed using G25 microspin columns (GE 
healthcare). The RNA was purified by phenol-chloroform-isoamy] al- 
cohol (Roth) extraction and precipitated. 


3.2. RNA structure probing 


RNA structure probing experiments were done as described pre- 
viously (Ehresmann et al., 1987; Luo et al., 1998) with minor mod- 
ifications. Typically, 0.6 ug of in vitro-synthesized RNA was heat-de- 
natured at 90 °C for 1 min and then cooled on ice for 5 min. The RNA 
was renatured in AN buffer (50 mM sodium cacodylate, pH 7.5, 5mM 
MgCly, 60 mM KCl) for 20 min at room temperature. Next, the samples 
(total volume of 8 ul) were mixed with 1 ul of yeast tRNA (2 mg/ml, 
Ambion) and 1 ul of dimethyl sulfate (DMS, Aldrich, #D186309) so- 
lution (diluted to 1/2, 1/5, 1/10 and 1/20, respectively, in 20% 
ethanol). Control reactions were done under equal conditions in the 
absence of DMS. Following incubation for 5 min at room temperature, 
the reactions were terminated by ethanol precipitation in the presence 
of 1/10 vol 3 M sodium acetate (pH 5.2). DMS modifications of specific 
nucleotides were determined by primer extension analysis. 


3.3. Primer extension assay 


To analyze DMS modifications, reverse transcription reactions were 
performed using one of the following oligonucleotides: 229E-1 
(5’-CGACTCCAGCATCAAAGATGC-3’; complementary to nts 93-113 of 
the HCoV-229E 5'-UTR) and NL63-1 (5’-CGAAATTTCAATTACACTAG 
GAC-3’; complementary to nts 107-129 of the HCoV-NL63 5'-UTR). 
Aliquots of chemically modified RNAs (3 pmol) were hybridized with 
1-3 pmol of 5’-end labeled primer (1-2 x 10° dpm). Following a brief 
heating step (90 °C, 2 min), the reaction was cooled slowly (5 min at 
75 °C, 10 min at 50°C, 5 min at 37 °C, 10 min at room temperature). 
Next, the primer annealing mixture was used to set up a 20-ull reverse 
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transcription reaction in 1 x SuperScript® III RTase reaction buffer 
supplemented with 170 units of SuperScript® III RTase (Invitrogen), 20 
units RNaseOUT (Invitrogen), and 1 mM of each dNTP. The reaction 
was performed at 42°C for 50min and then at 55°C for 60 min. 
Reactions were terminated by the addition of 1/10 vol of 3 M sodium 
acetate, pH 5.2, and 10 volumes of ice-cold ethanol. Following cen- 
trifugation, the pellets were washed with 70% ethanol. The dried pel- 
lets were resuspended in water and treated with DNase-free RNase A for 
20 min at 37 °C (0.2 mg/ml, Invitrogen). Next, PCR-grade Proteinase K 
(Invitrogen) was added to a final concentration of 1 mg/ml and the 
reaction was incubated for another 15 min at 55°C. Reactions were 
stopped by adding Fu-mix (6M urea, 80% deionized formamide, 1x 
TBE, 0.1% (w/v) Bromophenol blue, and 0.1% (w/v) Xylene cyanol). 
Reaction products were separated in TBE-buffered 8% polyacrylamide 
gels containing 7 M urea. Signals were visualized using a Typhoon 9200 
imager (GE Healthcare) and analyzed using Quantity One software 
(BioRad). 


3.4. Bioinformatic analyses 


RNA secondary structures were calculated using RNAfold, version 
2.4.1 (Lorenz et al., 2011). To calculate base-pairing probabilities, 
parameters were set to —noLP and -p. RNA secondary structures were 
visualized using VARNA (version 3.93) (Darty et al., 2009). The color 
codes used in Figs. 1, 2, 5B, and 8 indicate base-pairing probabilities 
derived from dot plots generated by RNAfold (see Suppl. Figures 1 and 
2). Structure-based alignments were calculated with LocARNA, version 
1.8.11 (Will et al., 2012). Consensus secondary structures were calcu- 
lated with RNAalifold -noLP —color -r -p (version 2.4.1) (Lorenz et al., 
2011). Herein, the color code represents the numbers of different base- 
pairing types and numbers of incompatible bases, respectively. Se- 
quence conversation was visualized using WebLogo 3.5.0 (Fig. 5C) 
(Crooks et al., 2004). 


4. Results 


4.1. RNA structure probing analysis of alphacoronavirus 5'-terminal 
genome regions 


To provide experimental support for our RNA structure model of 
alphacoronavirus 5'-terminal genome regions (Madhugiri et al., 2014) 
(Suppl. Figure 1), we performed a series of RNA structure probing ex- 
periments. RNA transcripts representing the 5’-terminal ~100 nt of the 
HCoV-229E and HCoV-NL63 genome, respectively, were produced in 
vitro and treated with the methylating agent DMS (Ehresmann et al., 
1987). N1 methylation of unpaired adenosines and N3 methylation of 
unpaired cytidines was identified by primer extension analysis using 
reverse transcriptase. 5'-[°2P]-labeled products obtained in these reac- 
tions were separated in denaturing polyacrylamide gels and visualized 
by phosphorimaging (see Materials and Methods). RNA that was not 
treated with DMS was included as a control to detect potential non- 
specific termination products of the reverse transcriptase reaction. 
Autoradiograms that are representative of an extensive set of DMS 
structure probing experiments and a summary of the structure probing 
information obtained in these experiments are shown in Fig. 1. Our 
RNA secondary structure predictions (Madhugiri et al., 2014) (Suppl. 
Figure 1) and the in vitro DMS structure probing data obtained in this 
study (Fig. 1) lead us to propose a model in which the 5’-terminal 80-nt 
regions of the HCoV-229E and NL63 genome RNAs fold into two con- 
served stem-loops (SL), called SL1 and SL2, while the adjacent 3' region 
containing the leader-associated transcription regulatory sequence 
(TRS-L) (Zuniga et al., 2004) does not adopt a stable structure. The SL1 
structures of HCoV-229E and NL63 appear to be fairly stable as none of 
the principal nucleotides forming the predicted SL1 stem structure were 
accessible to DMS modification, while several nucleotides predicted to 
be part of bulge or loop regions were modified by DMS (Fig. 1B and D). 
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Previous analyses suggested that (beta)coronavirus 5’-SL2 elements 
are made up of a 5-bp stem and a pentaloop (Kang et al., 2006). The 
DMS structure probing data presented in this study suggest that the SL2 
elements of both HCoV-229E and HCoV-NL63 are composed of a 4-bp 
(rather than 5-bp) stem and a pentaloop sequence (Fig. 1). Thus, for 
both viruses, A57 (positioned at the base of SL2) was regularly found to 
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Fig. 1. RNA structure probing analysis of 5’-terminal 80-nt genome regions of HCoV-229E 
and HCoV-NL63. A and C, Chemical probing of in vitro-transcribed RNAs representing the 
5’-terminal 80 nts of the genome RNAs of HCoV-229E and HCoV-NL63. RNAs were 
modified with DMS in the presence of Mg”* and K* ions (for details, see Material and 
Methods). DMS modifications of unpaired adenosines and cytidines were identified by 
primer extension analysis. Shown are the autoradiograms of representative denaturing 
polyacrylamide gels. Lanes: —, reaction performed in the absence of DMS; 1:2, 1:5, 1:10 
and 1:20, DMS dilutions used in the respective reactions; T, G, C, A, sequencing reactions 
using the indicated dideoxynucleotide. B and D, RNA secondary structure models of the 
5’-terminal 80-nt genome regions of HCoV-229E and HCoV-NL63. SL1 and SL2 structures 
are indicated and the TRS-L sequence is highlighted as a gray box. Black arrowheads 
indicate positions of DMS modifications that were reproducible in repeated experiments. 
RNA structures were predicted by RNAfold. The color code indicates base-pairing prob- 
abilities calculated with RNAfold. 


< 


be modified by DMS while none of the other adenosine and cytidine 
residues predicted to be part of the SL2 structures were found to be 
accessible to DMS modification. Base-pairing probabilities calculated 
with RNAfold further indicate that the basal part of HCoV-229E SL2 
may have a certain degree of flexibility (Figs. 1 and 2). 

Furthermore, the DMS structure probing data obtained for HCoV- 
NL63 (and, to a slightly lesser extent, HCoV-229E) suggest that the TRS- 
L element located downstream of SL2 is part of an unstructured region. 
As shown in Fig. 1, nucleotides of the TRS-L core sequence and nu- 
cleotides adjacent to the TRS region were accessible to DMS mod- 
ification, confirming that they are part of single-stranded regions. In 
conclusion, our structure model is consistent with previous betacor- 
onavirus studies (Kang et al., 2006; Liu et al., 2007; Yang et al., 2015) 
and supports the idea that, in most coronaviruses, TRS-L is part of an 
unstructured region rather than a stable SL structure (Van Den Born 
et al., 2004; Dufour et al., 2011) (see also Discussion). 


4.2. Disruption of SL1 and SL2 by single-nucleotide substitutions causes 
major defects in HCoV-229E RNA synthesis and virus reproduction 


Having established the existence of two conserved RNA structural 
elements in the 5’-leader regions of HCoV-229E and HCoV-NL63 (this 
study and Madhugiri et al., 2014), we sought to investigate the func- 
tional significance of these elements in alphacoronavirus RNA synthesis 
using a reverse genetics system developed for HCoV-229E (Thiel et al., 
2001). To this end, HCoV-229E genome-length RNAs containing ap- 
propriate nucleotide substitutions in SL1 and SL2, respectively, were 
generated and used to investigate possible effects on viral replication in 
cell culture (for details, see Material and Methods). Using RNAfold 
(Lorenz et al., 2011), we predicted the most probable structures for 
RNAs containing specific nucleotide substitutions in the HCoV-229E 5'- 
terminal genome region and, based on these predictions, designed a set 
of mutations to be introduced in the HCoV-229E genome RNA for 
subsequent cell culture studies. The first set of mutants (HCoV- 
229E_C11G, _C16G, _G45C, and _C47G) contained single-nucleotide 
substitutions predicted to disrupt specific base-pair interactions in SL1 
or SL2, resulting in a destabilization (or restructuring) of the respective 
secondary structures. Another set of mutants (HCoV- 
229E_C11G+G34C, C16G+G29C, _G45C+(C55G, and _C47G+ G53C) 
contained a second, compensatory mutation that restored the respective 
base-pairing interaction and, thus, preserved the stability of the stem 
structure (Fig. 2, panels C, E, G, and I). 

For the C11G mutation, computer-assisted RNA structure analyses 
predicted a partial destabilization, resulting in a larger bulge in the 
middle of SL1 and, accordingly, a reduction of the calculated minimal 
free energy (Fig. 2A and B). This structural change is also reflected by 
reduced base-pair probabilities in SL1 (compare color codes in Fig. 2A 
and B, Suppl. Figure 2B). Restoration of the base-pair interaction be- 
tween nt 11 and 34 in the HCoV-229E_C11G+G34C mutant was pre- 
dicted to preserve the wildtype RNA structure (Fig. 2A and C). For the 
C16G mutation, the calculation of base-pair probabilities (Fig. 2D and 
Suppl. Figure 2) predicted a profound destabilization of the upper 
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Fig. 2. RNA secondary structure predictions for HCoV-229E 5'- 
terminal genome sequences containing nucleotide substitutions. For 
each mutant, positions of substituted nucleotide(s) in the HCoV- 
229E genome are indicated (with numbers in boldface). SL1 and SL2 
structures are indicated and the TRS-L sequence is highlighted as a 
gray box. Nucleotides are numbered according to the wildtype se- 
quence, starting from the first nucleotide at the 5’-end of the 
genome. RNA secondary structures were predicted by RNAfold. The 
color code represents the base-pairing probability of the individual 
bases predicted by RNAfold. Dot plots for each structure are pro- 
vided in the supplement. 
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Fig. 3. Analysis of viral RNA accumulation and virus titers of HCoV-229E SL1 and SL2 
mutants. Recombinant HCoV-229E (WT) and HCoV-229E SL1 and SL2 mutants were 
generated by co-transfecting Huh-7 cells with the appropriate (mutant or wildtype) in 
vitro-transcribed genome-length HCoV-229E RNA and HCoV-229E N mRNA as described 
in Material and Methods. At 72h p.t, the virus titer in the cell culture supernatant was 
determined and viral RNA was analyzed by Northern blotting. A, Northern blot analysis of 
HCoV-229E-specific RNAs produced in cells transfected with the indicated HCoV-229E 
full-length RNAs. Virus-specific RNAs were detected using a [*P]-labeled DNA probe 
specific for the HCoV-229E 3’-UTR (nts 26857-27277). B, Virus titers of HCoV-229E 
wildtype (WT) and mutants were determined by end-point dilution using Huh-7 cells. 
Virus titers (means + SEM) are represented as TCIDs9/ml and were determined from 
three independent transfection experiments. 


segment of SL1 but also changes in adjacent regions (Fig. 2D and Suppl. 
Figure 2). Again, introduction of an additional compensatory mutation 
(G29C) was predicted to preserve the wildtype structure (Fig. 2E). For 
the G45C mutation, drastic structural changes were predicted for SL2, 
including a significantly less stable (4-bp) stem structure and a smaller 
loop size (3 instead of 5 nts) (Fig. 2F). Even more profound effects were 
predicted for the C47G replacement, resulting in a complete destabili- 
zation of the SL2 stem structure (Fig. 2H, Suppl. Figure 2). Taken to- 
gether, these RNA structure predictions suggested that the single-nu- 
cleotide and two-nucleotide substitutions introduced in the HCoV-229E 
genome RNA were suitable to study possible roles of the HCoV-229E 
SL1 and SL2 structures in viral replication. The predictions also con- 
firmed that the second-site (compensatory) substitutions were suitable 
to preserve (near-) wildtype structures for both SL1 and SL2. 

To study the effects of the structural changes caused by the 
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Fig. 4. Sequence analysis of the SL2_C47G mutant. A and C, Sequence analysis of RT-PCR 
products obtained from Huh-7 cells transfected with full-length HCoV-229E_C47G RNA at 
passage 0 (at 72 h p.t) and after serial passaging of the recombinant virus (passage 5). The 
position of the nucleotide substitution and reversion is indicated by an arrow. B, Serial 
passaging of the HCoV-229E_C47G mutant. 


nucleotide substitutions, in vitro transcribed full-length HCoV-229E 
RNAs (wildtype and mutants, respectively) and N mRNA were co- 
transfected into 90% confluent Huh-7 cells. At 72h p.t., cell culture 
supernatants were collected to determine virus titers and intracellular 
RNA was isolated for Northern blot analysis of viral RNA replication. 
Using a [°2P]-labeled probe specific for the 3’ end of the genome, we 
analyzed the full set of 3'-coterminal genomic and subgenomic HCoV- 
229E RNAs. We found that, except for mutant C11G (see below), the 
SL1 and SL2 single-nucleotide mutants displayed severe defects in viral 
RNA accumulation, suggesting that the structural integrity of SL1 and 
SL2 is essential for viral replication. In the case of C11G, only minor 
defects in viral RNA accumulation were observed, suggesting that the 
stability of the basal part of SL1 is less critical, with some structural 
flexibility being tolerated in this case. However, our observation that 
the double mutant (C11G + G34C) with a fully preserved SL1 structure 
replicates more efficiently than the C11G mutant shows that an intact 
basal part of SL1 is beneficial (though not essential) for virus replication 
(Fig. 2C, Fig. 3A, lane 6). Interestingly, similar observations were also 
reported for murine hepatitis virus (MHV). If the lower part of the 
(presumably equivalent) MHV SLI structure was disrupted, infectious 
virus progeny could still be recovered, while disruption of the upper 
part proved to be lethal (Li et al., 2008). For the HCoV-229E_C16G 
mutant, a major replication defect was observed which could be re- 
versed (albeit not completely) by restoring the base-pair interaction 
between nts 16 and 29 in SL1 (C16G+G29C) (Fig. 3A, lane 7). For the 
G45C and C47G mutations in SL2, we found that both mutations cause 
major defects in RNA replication (Fig. 3A, lanes 4 and 5). The corre- 
sponding double mutant HCoV-229E_G45C+C55G replicated with 
near-wildtype efficiency, while viral RNA accumulation was not fully 
restored in the double mutant C47G+G53C, suggesting additional 
constraints. Taken together, the mutagenesis study shows that the 
structural integrity of SL2 is essential for efficient HCoV-229E replica- 
tion. Similar observations were previously made for the betacor- 
onavirus MHV, where nucleotide substitutions that destabilized the SL2 
stem region resulted in a drastic reduction of viral RNA synthesis and 
production of infectious virus progeny (Liu et al., 2007). As mentioned 
above, all double mutants replicated more efficiently than their single- 
mutation counterparts (Fig. 3A, lanes 2-5 and 6-9), providing strong 
evidence for the existence and functional relevance of the HCoV-229E 
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Fig. 5. Conservation of 5'-terminal SL1 and SL2 structures in alpha- and betacoronaviruses. A, Secondary structure prediction of the 5’-terminal genome regions of 4 coronaviruses 
representing the genera Alphacoronavirus (HCoV-229E, HCoV-NL63) and Betacoronavirus (SARS-CoV, BCoV). The alignment was calculated by LocARNA and the structure by RNAalifold. 
The consensus sequence is represented using the IUPAC code. Colors are used to indicate conserved base pairs: from red (conservation of only one base pair type) to purple (all six base 
pair types are found); from dark (all sequences contain this base pair) to light colors (1 or 2 sequences are unable to form this base pair). The gray bars below the alignment indicate the 
extent of sequence conservation at a given position. Gray shadows are used to link RNA structures with the corresponding dot-bracket notations above the alignment. To refine the 
alignment, an anchor at the highly conserved SL2 was used. B, Individual SL2 structures were predicted by RNAfold. Nucleotides in gray boxes indicate the conserved loop sequence. 
Colors represent base-pairing probabilities calculated by RNAfold. C, WebLogo representation of the conserved loop sequence of SL2 (Crooks et al., 2006). For the structure-based 
alignment and WebLogo representation, sequences from human coronavirus (HCoV) 229E (NC_002645), HCoV-NL63 (isolate Amsterdam 1, NC_005831), bovine coronavirus (isolate 
BCoV-ENT, NC_003045), and severe acute respiratory syndrome coronavirus (SARS-CoV, strain Tor2, NC_004718) were used. 


SL1 and SL2 structures. 


Previous betacoronavirus studies suggested that, with few excep- 
tions, preservation of the SL1 and SL2 secondary structures is more 
important for viral replication than preservation of a specific nucleotide 


sequence (Liu et al., 2007; Li et al., 2008). By and large, our functional 
analysis of the HCoV-229E SL1 and SL2 elements supports these earlier 
proposals of the Leibowitz and Giedroc laboratories for the SL1 and SL2 
equivalents in MHV. For the WHCoV-220E_.C11G+G34C and 
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Fig. 6. Characterization of a recombinant HCoV-229E mutant carrying a replacement of 
the SL1 structure. Virus titers in the supernatants of Huh-7 cells transfected with re- 
combinant HCoV-229E wildtype RNA (WT) or HCoV-229E RNA carrying a replacement of 
the cognate HCoV-229E SL1 element with the structural counterpart from HCoV-NL63 
(SL1-NL63) were determined as described in Material and Methods. Virus titers 
(means + SEM) are represented as TCIDso/ml and were determined from three in- 
dependent transfection experiments. 


G45C+C55G mutants, we were able to show that RNA synthesis re- 
turned to wildtype levels if the base-pairing potential was restored by 
introducing appropriate compensatory mutations (Fig. 3A, lane 6 and 
8). In contrast, genome replication and sg mRNA synthesis and virus 
progeny production (see below) did not revert to wildtype levels in the 
case of C16G + G29C and C47G + G53C even though the stem structures 
were restored in these mutants (Fig. 3A, lanes 7 and 9). Our results 
strongly suggest that not only the SL1 and SL2 structures but also the 
nucleotide sequence plays an important role in RNA synthesis and the 
production of infectious virus progeny (see below). 

Along with the analysis of viral RNA accumulation, we measured 
virus titers in the supernatants of transfected cells (Fig. 3B). Titers are 
given as mean values and standard error of the mean ( + SEM) and 
were determined from three independent transfection experiments. In 
all SL1 and SL2 mutants, transfection of full-length (wildtype or mu- 
tant) genome RNA gave rise to infectious virus progeny, but virus titers 
varied greatly among the different mutants (Fig. 3B). Substitution of 
C11 with G resulted in an approximately 10-fold reduced titer com- 
pared to the wildtype virus, consistent with the moderate reduction in 
viral RNA synthesis observed for this mutant (Fig. 3A, lane 2). In con- 
trast, the C16G, G45C, and C47G substitutions caused a drastic 
100-1000-fold) reduction of virus titers, suggesting severe defects in 
viral replication. Upon restoration of the base pairing in the 
SL1_C11G+G34C and SL2_G45C+C55G SL2 mutants, the production 
of infectious virus progeny returned to (near) wildtype levels. Con- 
sistent with the Northern blot data presented above, restoration of base- 
pairing interactions in the Cl6G+G29C and C47G+G53C mutants 
failed to restore the full replication potential. Overall, the titers ob- 
tained for the HCoV-229E SL1 and SL2 mutants correlate very well with 
the RNA replication data (Fig. 3A), suggesting that the introduced 
mutations primarily affect viral RNA synthesis rather than a late step in 
the viral life cycle. 


4.3. Rapid reversion of the C47G substitution to the wildtype sequence 


As illustrated in Fig. 2, single-nucleotide substitutions in the stem of 
SL2 were predicted to cause major structural rearrangements and, 
consistent with the presumed cis-acting function of this element, re- 
sulted in severe defects in viral RNA synthesis and reproduction 
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(Fig. 3). To test if the introduced mutations were stable enough to allow 
their phenotypes to be analyzed in passage 0 (p0), we subjected viral 
RNAs isolated from pO virus stocks of the SL1 and SL2 mutants to 
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Fig. 7. Characterization of recombinant HCoV-229E mutants carrying replacements of 
the SL2 structure. A, Virus titers of HCoV-229E wildtype (WT) and HCoV-229E mutants 
carrying SL2 elements from BCoV (SL2-BCoV) and SARS-CoV (SL2-SARS). Cell culture 
supernatants were collected at 72h p.t. and titers were determined as described in 
Material and Methods. B, Recombinant HCoV-229E (WT) and HCoV-229E mutants car- 
rying SL2 structures from BCoV or SARS-CoV (SL2-BCoV, SL2-SARS-CoV) were serially 
passaged and virus titers were determined for passage 6 (p6) virus stocks using cell 
culture supernatants collected at 48 h p.i.. Virus titers (means + SEM) are represented as 
TCIDs0/ml and were determined from three independent transfection (p0) and infection 
(p6) experiments, respectively. C, Plaques sizes of the indicated recombinant viruses. D, 
Sequence analysis of the genome region containing the indicated SL2 replacement in the 
HCoV-229E genome using RNA isolated at passage 0 (72 h p.t.). Positions of wildtype and 
betacoronavirus-derived SL2 sequences are indicated below the chromatograms. 
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Fig. 8. Predicted RNA secondary structure of an HCoV-229E SL2 element carrying two A/ 
U to G/C replacements. Positions of nucleotide substitutions are indicated with numbers 
(in boldface) in the SL2-mut structure. RNA secondary structures were predicted by 
RNAfold. Colors represent the base-pairing probability calculated by RNAfold. 


partial genome sequence analyses covering the entire 5'-UTR and 3'- 
UTR regions. In all cases, the introduced mutations were found to be 
retained, suggesting that the virus titration and Northern blot data 
shown Fig. 3 reflect ‘true’ phenotypes of the SL1 and SL2 mutants 
generated in this study. Only in one case, SL2_C47G, we obtained evi- 
dence for a rapid reversion back to the wildtype sequence. As shown in 
Fig. 4A, we observed an additional C peak at position 47. To corrobo- 
rate this observation, the C47G mutant was subjected to 5 serial pas- 
sages in Huh-7 cells (Fig. 4B) and RNA isolated from virus-infected cells 
was used for sequence analysis. The sequence data confirmed a com- 
plete replacement of the C47-to-G mutation with the wildtype nucleo- 
tide C47, thereby restoring the Watson-Crick base pairing of nucleotides 
47 and 53 in this revertant (Fig. 4C). The data lead us to suggest a 
critical role for the C47-G53 base pair in supporting specific SL2 
structure-function relationships required for coronavirus RNA synth- 
esis. The data also suggest that the (low) titer determined for the C47G 
pO virus stock may represent an overestimate of the real replication 
efficiency of the C47G mutant because, even at this early time point 
p.t., a significant proportion of virus genomes had reverted to the 
wildtype sequence. We did not observe reversions or second-site sub- 
stitutions in the SL2-C47G + G53C double mutant, where Watson-Crick 
base pairing was restored (Fig. 2), suggesting that RNA synthesis and 
viral reproduction, although being reduced compared to the wildtype, 
were sufficient for virus growth in cell culture. Nevertheless, the partial 
growth defect of this double mutant indicates that not only the helical 
stem structure but also the sequence might play a role in viral RNA 
synthesis (Fig. 3). 


4.4. HCoV-229E SL2 is functionally exchangeable with betacoronavirus 
SL2 elements 


Genus-wide consensus secondary structural models indicated that 
the 5’-terminal ~150 nt of alpha- and betacoronaviruses folds into 
three highly conserved structures that are generally referred to as SL1, 
SL2, and SL4 (Madhugiri et al., 2014, 2016; Yang and Leibowitz, 2015). 
This conservation is illustrated in Fig. 5A for 4 coronaviruses re- 
presenting the genera Alpha- and Betacoronavirus. Previous studies 
suggested that SL2 represents the most conserved RNA structural 
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element in coronavirus genomes (Kang et al., 2006; Chen and 
Olsthoorn, 2010). As illustrated for 4 representative coronaviruses in 
Fig. 5B, SL2 is always comprised of a 5-base-pair helical stem and, in 
most cases, a pentaloop structure. The loop sequence (5’-(C/U)UUG(U/ 
C)—3’) is highly conserved while the stem sequence is variable 
(Fig. 5C). Previous studies in betacoronavirus systems revealed that 
intra-genus replacements of 5’-terminal RNA structural elements may 
result in viable viruses (Kang et al., 2006), supporting the high degree 
of both structural and functional conservation among betacoronavirus 
cis-acting elements in the 5' genome region. These earlier and our own 
studies led us to suggest that several RNA structural elements in the 5' 
and 3' UTRs are not only conserved among alphacoronaviruses but also 
across different genera of the Coronavirinae (Madhugiri et al., 2014). To 
test this hypothesis, we constructed a mutant in which the HCoV-229E 
SL1 was replaced with the equivalent structure of HCoV-NL63. Trans- 
fection of in vitro-transcribed HCoV-229E genome RNA containing this 
intra-genus replacement gave rise to infectious virus progeny that re- 
plicated to near-wildtype titers (Fig. 6), confirming the functional 
conservation of this structure among alphacoronaviruses. As the SL2 
sequences of HCoV-229E and HCoV-NL63 are identical, a replacement 
of the SL2 structures was dispensable. Instead, we decided to study the 
extent of SL2 conservation across genus boundaries by extending our 
studies to the SL2 elements of betacoronaviruses. To our knowledge, 
inter-genus exchanges of cis-acting elements in coronavirus 5'-proximal 
genome regions have not been performed previously. Using the vaccinia 
virus-based reverse genetics system, we replaced the HCoV-229E SL2 
structure in the full-length HCoV-229E cDNA sequence with the struc- 
tural counterpart from BCoV and SARS-CoV, respectively, representing 
different lineages of the genus Betacoronavirus. In vitro-transcribed full- 
length ‘chimeric’ and wildtype HCoV-229E genome RNAs were trans- 
fected into Huh-7 cells. At 72h p.t., virus titers in cell culture super- 
natants collected from transfected cells were determined from three 
independent transfection experiments. In all cases, we were able to 
recover infectious virus progeny (Fig. 7). Exchange of the HCoV-229E 
SL2 with that of BCoV resulted in a viral titer of > 10° TCIDs0/ml 
(Fig. 7A), while a replacement with the SL2 of SARS-CoV resulted in 
significantly lower titers (Fig. 7A) and slightly smaller plaques sizes 
compared to the parental HCoV-229E virus (Fig. 7C). Sequence analyses 
of cDNA obtained from the chimeric virus progeny confirmed that the 
exchanges introduced into the HCoV-229E genome were retained 
(Fig. 7D) and no additional second-site mutations were identified in the 
5’- and 3'-UTRs in the recovered viruses. Furthermore, we passaged the 
chimeric viruses ‘blindly’ for six times. Titration of these serially pas- 
saged viruses revealed near-wildtype growth for the virus carrying the 
BCoV SL2, while the virus carrying the SARS-CoV structure replicated 
to slightly lower titers (Fig. 7D). The increase in titers might be due to 
the accumulation of compensatory mutations. To address this possibi- 
lity, viral cDNA produced from the p6 virus stocks of the BCoV-SL2 and 
SARS-SL2 mutants, respectively, were subjected to sequence analysis 
covering the entire 5'- and 3’-UTRs and the replicase gene sequences 
encoding nsp7 to 12. Analysis of the consensus sequence of the virus 
stocks confirmed that the introduced SL2 replacements were retained 
after six passages (data not shown) and no compensatory mutations 
were detected in this partial genome sequence analysis. To understand 
how these chimeric viruses evolved to almost wild-type like titers 
(while retaining the engineered substitutions), complete sequence 
analyses of single plaque-purified (high-passage) chimeric viruses re- 
main to be performed in future studies. Taken together, these inter- 
genus exchange data provide experimental proof for our hypothesis that 
several coronavirus cis-acting RNA elements are conserved, both 
structurally and functionally, among different coronavirus genera. 


4.5. Mutations predicted to increase the stability of SL2 are not tolerated 


As shown in Fig. 3, nucleotide substitutions predicted to destabilize 
SL2 cause major defects in HCoV-229E RNA synthesis and virus 
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reproduction, demonstrating the functional relevance of this SL struc- 
ture. Interestingly, the helical SL2 stem of most coronaviruses is com- 
posed of three A-U and two G-C base pairs, respectively (Fig. 5A and B, 
Fig. 8A). We therefore asked the question of whether this pattern of 
base-pair interactions reflects a finely balanced stability of this struc- 
ture. To address this question, we stabilized the SL2 structure by in- 
troducing 4 mutations. The mutations replaced two A/U base pairs at 
positions 44 + 56 and 46 + 54 with G/C base pairs and were predicted to 
decrease the minimal free energy of this secondary structure (Fig. 8). 
An in vitro-transcribed full-length genome RNA containing this set of 
mutations and a wildtype control RNA, respectively, were transfected 
into Huh-7 cells and virus titers in cell culture supernatants collected at 
72h p.t. were determined. In repeated experiments, we failed to re- 
cover viable virus containing these ‘stabilizing’ mutations in the stem 
region of SL2, while the wild-type virus was readily recovered with high 
titers. The data demonstrate specific sequence requirements for the SL2 
stem region. It remains to be studied in further experiments if these 
sequence constraints reflect a requirement for an ‘optimal stability’ of 
SL2 or rather the presence of specific nucleotides at specific positions. 


5. Discussion 


In this study, we used a combined bioinformatics, biochemical and 
reverse genetics approach to characterize the structures and functions 
of the putative cis-acting SL1 and SL2 elements of the alphacoronavirus 
HCoV-229E (Madhugiri et al., 2014, 2016; Yang and Leibowitz, 2015). 
Based on RNA structure probing information obtained for in vitro- 
transcribed RNAs representing appropriate genome sequences of HCoV- 
229E and a second alphacoronavirus (HCoV-NL63), combined with 
bioinformatics studies, we present a robust RNA secondary structure 
model for the 5’-terminal 80 nts of the HCoV-229E genome that we 
think to be representative for other alphacoronaviruses. We also pro- 
vide evidence that (i) the SL1 and SL2 RNA structural elements are 
required for viral replication and (ii) the structures and functions of 
these elements are conserved among alphacoronaviruses (and, prob- 
ably, betacoronaviruses). The study revealed a number of interesting 
parallels to the SL1 and SL2 elements of betacoronaviruses which have 
been characterized extensively in previous studies and shown to be 
required for BCoV and MHV genome replication and sg mRNA synthesis 
(Kang et al., 2006; Liu et al., 2007, 2009a; Li et al., 2008). 

The RNA structure probing data presented in this study (Fig. 1) are 
consistent with models developed previously for a range of beta- and, to 
a lesser extent, alphacoronaviruses (Kang et al., 2006; Liu et al., 2007, 
2009a; Li et al., 2008; Madhugiri et al., 2014; Yang et al., 2015). The 
probing information provides experimental support for the presence of 
stable SL1 and SL2 structures in the 5'-terminal genome region and 
suggests that the TRS-L, together with flanking sequences, is part of an 
unstructured region. The latter conclusion is consistent with previous 
studies in which the TRS-L was proposed to be located in an un- 
structured region in the majority of coronavirus genomes (Liu et al., 
2007; Madhugiri et al., 2014; Yang et al., 2015). Furthermore, most 
structure prediction applications place (or can be forced to place) the 
coronavirus TRS-L in SL structures that would only be supported by two 
conserved base pairs, arguing against a major role of such a structure 
(Raman et al., 2003; Liu et al., 2007; Chen and Olsthoorn, 2010; Yang 
et al., 2015; Madhugiri et al., 2016). In this context, it should be noted 
that there is also evidence that, in a subset of alphacoronaviruses 
(TGEV), betacoronaviruses (including BCoV) and gammacoronaviruses, 
an additional structural element (SL3, also called SL-II in several BCoV 
studies) may exist. For example, studies on the TRS-L element of TGEV 
using NMR spectroscopy, UV thermal denaturation experiments and a 
reverse genetics approach suggested the existence of a defined hairpin 
structure in this genome region (Dufour et al., 2011). Most of the TGEV 
TRS-L core sequence was proposed to be located in a heptaloop region 
of a hairpin structure with moderate (possibly optimized) thermal sta- 
bility. Both the structure and stability of this TRS-L hairpin structure 
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was shown to play a role in TGEV replication and transcription where it 
was proposed to act as a landing platform for the nascent minus-strand 
RNA, similar to the similarity-assisted RNA recombination model pro- 
posed earlier by Nagy et al. (Nagy and Simon, 1997). Taken together, 
the available information suggests that the TRS-L region may be 
structurally flexible and adopt alternative structures to regulate specific 
steps of viral RNA synthesis. 

The HCoV-229E mutagenesis data obtained in our reverse genetics 
study of SL1 and SL2 mutants provide experimental support for the 
functional relevance of these elements in viral replication. Based on 
computer-assisted structure predictions for SL1 and SL2 variants con- 
taining specific mutations in stem regions, a set of mutants was de- 
signed in which the respective structures were destabilized or dis- 
rupted. Possible effects of the mutations on viral replication were 
subsequently studied in cell culture. The mutagenesis data obtained for 
the SL1 mutants confirm a critical role for SL1 in HCoV-229E replica- 
tion. The data also revealed that destabilizing mutations in the upper 
and lower parts of the SL1 structure have quite different effects on viral 
replication. Furthermore, the incomplete restoration of the in vitro 
growth characteristics of the C16G+G29E mutant suggests additional 
(sequence) constraints which remain to be investigated in further stu- 
dies. The less critical role observed for C11 (which acts to stabilize the 
lower part of SL1) in viral replication and the observation that the 
double mutant C11G+G34C replicated with near-wildtype character- 
istics suggest that the lower part of SL1 tolerates some structural 
changes, possibly indicating flexibility in this part of the structure. 
These observations are reminiscent of data reported by the Giedroc 
laboratory for MHV (Li et al., 2008). In this case, the upper part of the 
SL1 stem was found to be required for efficient MHV replication while a 
less stable structure of the lower part was largely tolerated. Interest- 
ingly, the MHV study also detected second-site suppressor mutations in 
the 5’- and 3’-UTRs in some of the SL1 mutants. Based on these second- 
site mutations, a “dynamic SL1” model was proposed in which the 
lower part of SL1 is required to have an optimized flexibility to mediate 
physical interactions between the 5’- and 3’-UTRs that, for example, 
may stimulate sg mRNA synthesis. To date, we failed to detect any 
second-site suppressor mutations in the 5’- and 3’-UTRs of HCoV-229E 
in our SL1 mutants. Possible reasons for this discrepancy from the MHV 
data remain to be studied but may relate to the more drastic (deletion) 
mutations introduced in the MHV SL1 structure, which may have forced 
the development and fixation of compensatory mutations in the MHV 
mutants (Li et al., 2008) while, in our own study, single-nucleotide 
substitutions were introduced in the HCoV-229E SL1. 

As mentioned above, SL2 represents the most conserved cis-acting 
RNA element in coronaviruses (Kang et al., 2006; Chen and Olsthoorn, 
2010), suggesting an important function in coronavirus replication. Our 
HCoV-229E SL2 mutagenesis data strongly support this hypothesis. 
Thus, any disruption of G-C base pair interactions predicted to desta- 
bilize the HCoV-229E SL2 stem structure (Fig. 2F and H) caused major 
defects in viral replication (Fig. 3) while restoration of the helical stem 
in the double mutant G45C+C55G resulted in a wildtype phenotype 
(Fig. 3). Surprisingly, the other double mutant (C47G + G53C) that was 
predicted to preserve the SL2 stem structure (Fig. 2, panel I) was found 
to have partial defects in RNA replication and production of infectious 
virus progeny (Fig. 3). Furthermore, the rapid reversion of the C47G 
mutant to the wildtype sequence (Fig. 4) indicates a strong selection 
pressure to maintain this particular base pair interaction. While our 
mutagenesis data, combined with extensive MHV SL2 mutagenesis 
studies (Kang et al., 2006; Liu et al., 2007), establish an essential role 
for SL2 in alpha- and betacoronavirus replication, more studies will be 
required to investigate the precise roles of residues in the HCoV-229E 
SL2 stem and loop regions, including base pair interactions within the 
loop or at its base (C47), and, possibly, also unravel the special role of 
the C47-G53 pair in the function of SL2. 

Based on our own and other studies (see below), it was tempting to 
suggest that the (structurally) conserved 5’-proximal cis-acting RNA 
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elements, including SL1 and SL2, may also be functionally conserved 
across coronavirus genera. To test this idea, we constructed HCoV-229E 
mutants in which the cognate SL2 element was replaced with that of 
BCoV and SARS-CoV, respectively. Our study was guided by earlier 
bioinformatic analyses (Madhugiri et al., 2014) that suggested con- 
servation of RNA secondary structures in the UTRs of viruses from the 
same genus but also other coronavirus genera. Previously, such a 
structural and functional conservation had only been confirmed for 
members of the same genus (Goebel et al., 2004; Kang et al., 2006). In 
the present study, we were able to extend these previous conclusions to 
alphacoronaviruses by showing that a replacement of the HCoV-229E 
SL1 with the equivalent structure from HCoV-NL63 (Fig. 6) was toler- 
ated very well, with titers of the chimeric virus approaching that of the 
wildtype virus. More importantly, we were able to show that the SL2s of 
BCoV and SARS-CoV, respectively, can act, at least in part, as functional 
substitutes for the cognate SL2 structure in the HCoV-229E genome. To 
our knowledge, this is the first experimental proof that the 5’-terminal 
SL2 is both structurally and functionally conserved among alpha- and 
betacoronaviruses. In line with previous studies (Kang et al., 2006), the 
four coronaviruses included in the present study (representing the 
genera Alpha- and Betacoronavirus) share a short 4-5-bp) helical stem 
and a highly conserved pentaloop sequence, 5’-(U/C)UUGU-3’ (Fig. 5). 
Although the sequence of the predicted helical stems is not conserved 
(Fig. 5C), viable viruses carrying the BCoV or SARS-CoV SL2 counter- 
parts could be recovered as confirmed by virus titration and sequence 
analysis (Fig. 7A and D). We did not observe second-site mutations in 
the introduced SL2 structures and the entire 5’- and 3’-UTR regions in 
virus stocks collected at 72 h p.t. (pO) and after serial passaging (p6), 
respectively (Fig. 7). Complete genome analyses of plaque-purified 
mutants remain to be performed in future studies to exclude compen- 
satory mutations in other genome regions. Although several features 
are conserved among alpha- and betacoronavirus SL2 structures, which 
may explain the functionality of the betacoronavirus SL2 structure in an 
alphacoronavirus context, more studies are required to fully understand 
the critical parameters required for the SL2 function(s) in virus re- 
plication. One of these parameters might be an optimal stability of the 
4-5-bp stem of SL2. In this context, the observed lethal phenotype of an 
HCoV-229E mutant carrying an SL2 with two additional G-C base pairs 
(Fig. 8) provides preliminary evidence to suggest that both the nu- 
cleotide sequence and the stability of the stem may play a more im- 
portant role than previously thought and, thus, deserve further studies. 

Taken together, our functional characterization of alphacoronavirus 
SL1 and SL2 elements strongly supports the idea that, despite very 
limited sequence conservation, a number of cis-acting RNA elements 
including SL1 and SL2 are structurally conserved and have similar 
functions in coronavirus replication. 
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